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Abstract 

We propose an approach to analyze the asymptotic behavior of Polya urns based on the 
contraction method. For this a combinatorial discrete time embedding of the evolution of 
the composition of the urn into random rooted trees is used. A decomposition of the trees 
leads to a system of recursive distributional equations which capture the distributions of 
the numbers of balls of each color. Ideas from the contraction method are used to study 
such systems of recursive distributional equations asymptotically. We apply our approach 
to a couple of concrete Polya urns that lead to limit laws with normal limit distributions, 
with non-normal limit distributions and with asymptotic periodic distributional behavior. 
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1 Introduction 

In this paper we develop an approach to prove limit laws for Polya urn models by the con- 
traction method. We consider an urn with balls of a finite number m > 2 of different colors, 
numbered by 1, . . . , m. The evolution of a Polya urn is determined by an m x m replacement 
matrix R = {aij)i<ij<m which is given in advance together with an initial (time 0) composi- 
tion of the urn with at least one ball. Time evolves in discrete steps. In each step one ball is 
drawn uniformly at random from the urn. If it has color i it is placed back to the urn together 
with aij balls of color j for all j = 1, . . . , m. The steps are iterated independently. A classical 
problem is to identify the asymptotic behavior of the numbers of balls of each color as the 
number n of steps tends to infinity. The literature on this problem, in particular on limit 
laws for the normalized numbers of balls of each color, is vast. We refer to the monographs of 
Johnson and Kotz |l3] and Mahmoud |18j and the references and comments on the literature 
in the papers of Janson [12], Flajolet et al. [10] and Pouyanne [23]. 

A couple of approaches have been used to analyze the asymptotic behavior of Polya urn 
models, most notably the method of moments, discrete time martingale methods, embeddings 
into continuous time multitype branching processes, and methods from analytic combinatorics 



based on generating functions. All these methods use the "forward" dynamic of the urn 
process by exploiting that the distribution of the composition at time n given time n — 1 is 
explicitly accessible. 

In the present paper we propose an approach based on a "backward" decomposition of 
the urn process. We construct an embedding of the evolution of the urn into an associated 
combinatorial random tree structure growing in discrete time. Our associated tree can be 
decomposed at its root (time 0) such that the growth dynamics of the subtrees of the root 
resemble the whole tree in distribution. More precisely we have different types of distributions 
for the associated tree, one type for each possible color of its root. The decomposition of the 
associated tree into subtrees gives rise to a system of distributional recurrences for the numbers 
of balls of each color. To extract the asymptotic behavior from such systems we develop an 
approach in the context of the contraction method. 

The contraction method is well known in the probabilistic analysis of algorithms. It was 
introduced by Rosier [25] and first developed systematically in Rachev and Riischendorf [24] . 
A rather general framework with numerous applications to the analysis of recursive algorithms 
and random trees was given by Neininger and Riischendorf [2QJ . The contraction method has 
been used for sequences of distributions of random variables (or random vectors or stochastic 
processes) that satisfy an appropriate recurrence relation. To the best of our knowledge it 
has not yet been used for systems of such recurrence relations as they arise in the present 
paper, the only exception being Leckey et al. [T7] where tries are analyzed under a Markov 
source model. 

The aim of this paper is not to compete with other techniques with respect to generality 
under which urn models can be analyzed. Instead we discuss our approach at a couple 
of examples illustrating the contraction framework in three frequently occurring asymptotic 
regimes: normal limit laws, non-normal limit laws and regimes with oscillating distributional 
behavior. We also discuss the case of random entries in the replacement matrix. Our proofs 
are generic and can easily be transfered to other urn models or be developed into more 
general theorems when asymptotic expansions of means (respectively means and variances in 
the normal limit case) are available, cf. the types of expansions of the means in section [3j 

A general assumption in the present paper is that the replacement matrix is balanced, 
i.e., that we have X^Jli ^ij ='■ K — 1 for all i = 1, . . . ,m, where K > 2 is a, fixed integer. 
(The notation K is unfortunate since this integer is not random and mainly chosen to have 
similarity in notation with earlier work on the contraction method.) An implication of the 
balance condition is that the growths of the subtrees of the associated tree processes can 
asymptotically jointly be captured by Dirichlet distributions. This leads to characterizations 
of the limit distributions in all cases (normal, non-normal and oscillatory limits) by systems, 
cf. ([6])-([8]) below, of distributional fixed-point equations where all coefficients are powers of 
components of a Dirichlet distributed vector, see also the discussion in section [3j It may be 
an interesting aspect of the present approach that all three regimes are governed by these 
quite similar types of systems of distributional fixed-point equations. 

The paper is organized as follows: in section [2] we introduce the associated trees into 
which the urn models are embedded and derive the systems of distributional recurrences for 
the numbers of balls of a certain color from the associated trees. In section [3] we outline 
the types of systems of fixed-point equations that emerge from the distributional recurrences 
after a proper normalization. To make these recurrences and fixed-point equations acces- 
sible to the contraction method, in section |4] we first introduce spaces of distributions and 
appropriate cartesian product spaces together with metrics on these product spaces. The 
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metrics in use are product versions of the minimal Lp metrics and product versions of the 
Zolotarev metrics. In section [5] we use these spaces and metrics to show that our systems of 
distributional fixed-point equations uniquely characterize vectors of probability distributions 
via a contraction property. These cover the types of distributional fixed-point equations that 
appear in the final section [6] where we discuss examples of limit laws for Polya urn schemes 
within our approach. In section [6] also our convergence proofs are worked out, again based 
on the product versions of the minimal Lp and Zolotarev metrics. 

Notation. By — ^ convergence in distribution is denoted. We denote the normal distribution 
on M with mean /i G M and variance o"^ > by AA(/i, a"^). In the case = this degenerates to 
the Dirac measure in 0. Throughout the Bachmann-Landau symbols are used in asymptotic 
statements. By log(x) for x > the natural logarithm of x is denoted. 



2 A recursive description of Polya urns 

In this section we explain the embedding of urn processes into associated combinatorial ran- 
dom tree structures growing in discrete time. The distributional self-similarity within the 
subtrees of the roots of these associated trees leads to systems of distributional recurrences 
which are the basis of our approach. 

The Polya urn. To develop our approach we first consider an urn model with two colors, 
black and white, and a deterministic replacement matrix i?, where the sums of the entries in 
each row is constant. Below, an extension of this approach to urns with more than two colors 
and replacement matrices with random entries is discussed as well. However, the assumption 
that the sums of the entries in each row is the same is crucial for our method, the reason also 
being explained below. To be definite we use the replacement matrix 

with a,ci e No U {-1} and 6,c G No (1) 
with 



R 



a b 
c d 



a + b = c + d=: K -1>1. 

Hence after drawing a black ball this ball is placed back to the urn together with a new black 
balls and b new white balls. If drawing a white ball it is placed back to the urn together with 
c black balls and d white balls. A diagonal entry a = — 1 (or d = —1) implies that a drawn 
black (or white) ball is not replaced to the urn while balls of the other color are still added 
to the urn. As initial configuration we consider both, one black ball or one white ball. Other 
initial configurations can be dealt with as well, also discussed below. We denote by the 
number of black balls after n steps when initially starting with one black ball, by the 
number of black balls after n steps when initially starting with one white ball. Hence we have 
5^ = 1 and B^ = 0. 

The associated tree. We encode the urn process as follows by a discrete time evolution 
of a random tree with nodes colored black or white. This tree is called associated tree. The 
initial urn with one ball, say a black one, is associated with a tree with one root node of the 
same (black) color. The ball in the urn is represented by this root node. Now drawing the 
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ball and placing it back to the urn together with a new black balls and b new white balls is 
encoded in the associated tree by adding a + b + 1 = K children to the root node, a + 1 of 
them being black and b being white. The root node then no longer represents a ball in the 
tree, whereas the K new leaves of the tree now represent the K balls in the urn. Now, we 
iterate this procedure: At any step, a ball is drawn from the urn. It is represented by one of 
the leaves, say node v in the tree. The urn follows its dynamic. If the ball drawn is black, the 
(black) leaf v gets K children, a + 1 black ones and b white ones. Similarly, if the ball drawn 
is white, the (white) leaf v gets c black children and d + 1 white children. In both cases v 
no longer represents a ball in the urn. The ball drawn and the new balls are represented by 
the children of v. The correspondence between all other leaves of the tree and the other balls 
in the urn remains unchanged. For an example of an evolution of an urn and its associated 
tree see Figure 1. Hence, at any time the balls in the urn are represented by the leaves of the 
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Figure 1 : A realization of the evolution of the Polya urn with replacement matrix [ 2 f ] 



and 



initially one white ball. The arrows indicate which ball is drawn (resp. node is replaced) in 
each step. Below each urn its associated tree is shown. Leaf nodes correspond to the balls in 
the urn, non-leaf nodes (crossed out) do no longer correspond to balls in the urn. However, 
their color still matters for the recursive decomposition of the associated tree. 



associate tree, where the colors of balls and representing leaves match. Each node of the tree 
is either a leaf or has K children. We could as well simulate the urn process by only running 
the evolution of the associated tree as follows: Start with one root node of the color of the 
initial ball of the urn. At any step choose one of the leaves of the tree uniformly at random, 
inspect its color, add K children to the chosen leaf and color these children as defined above. 
Then after n steps the tree has n{K — 1) + 1 leaves, the number of black leaves is distributed 
as B}1 if the root node was black and distributed as B^, if the root node was white. 

Subsequently it is important to note the following recursive structure of the associated 
tree: For a fixed replacement matrix of the Polya urn we consider the two initial compositions 
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of one black respectively one white ball and their two associated trees. We call these the b- 
associated respectively w-associated tree. Consider one of these associated trees after n steps. 
It has n{K — 1) + 1 leaves and each subtree rooted at a child of the associated tree's root 
(we call them shortly only subtrees) has a random number of leaves according to how often 
a leaf node has been chosen for replacement in the subtree. We condition on the numbers of 
leaves of the subtrees to be ir{K — 1) + 1 with ir G No for r = 1, . . . ,K. Note that we have 
X^^i ir = n — 1, the —1 resulting from the fact that in the first step of the evolution of the 
associated tree the subtrees are being generated, only afterwards they start growing. Prom 
the evolution of the b-associated tree it is clear that conditionally on the subtrees' numbers 
of leaves being ir{K — 1) + 1 the subtrees are stochastically independent and the r-th subtree 
is distributed as an associated tree after ir steps. Whether it has the distribution of the b- 
or the w-associated tree depends on the color of the subtree's root node. 

To summarize we have that conditionally on their numbers of leaves the subtrees of as- 
sociated trees are independent and distributed as associated trees of corresponding size and 
type inherited from the color of their root node. 

Growth of subtrees. In our analysis the asymptotic growth of the K subtrees of the 
associated tree is used. We denote by 7*^") = . . . , I^^) the vector of the numbers of 

draws of leaves from each subtree after n draws in the full associated tree. In other words, 
Ir"'\K — 1) + 1 is the number of leaves of the r-th subtree after n G No steps. We have 
I^^^ = (0, ...,0), and /^^^ is a vector with all entries being except for one coordinate 
which is 1. To describe the asymptotic growth of I^^^ we need the Dirichlet distribution 
Dirichlet [{K — . . . , {K — 1)"^): It is the distribution of a random vector (Di, . . . , Dk) 
with J2^=i — 1 such that {Di, . . . , Dk-i) has a Lcbcsgue-density supported by the 
simplex Sk ■■= {{xi, xk-i) G [0, 1]-^"^ | ^.^=1 < 1} given by 

TT ^ ^ r((K-i)-y-^ 

X = [Xi,. . . ,XK-l) ^ CK [[ Xr , XESk, Cr = K -1 ' 

where T denotes the gamma function. In particular, Di,. . . ,Dk are identically distributed 
with the heta{{K — 1)"^, 1) distribution, i.e., with Lebesgue-density 

X {K - l)xT<^ , xG[0, 1]. 
We have the following asymptotic behavior of I^'^^ : 

Lemma 2.1. Consider a Polya urn with constant row sum K —1 >1 and its associated tree. 
For the numbers of balls I*-"^ = . . . ,1^ '^) drawn in each subtree of the associated tree 

when n balls have been drawn in the whole associated tree we have, as n oo, 

( An) An) \ 

\ n n I 

almost surely and in any Lp, where {Di, . . . ,Dk) has the Dirichlet distribution 
£.{01,..., Dk) = Dirichlet 



K-l 



K -1 
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Proof. The sequence {i[^\k—1) + 1, . . . , I^\k —1) + l)n^iq^ has an interpretation by another 
urn model, which we call the subtree-induced urn: For this we give additional labels to the 
leaves of the associated tree. The set of possible labels is {1, . . . , K} and we label a leaf j if 
it belongs to the j-th subtree of the root (any ordering of the subtrees of the root is fine). 
Hence, all leaves of a subtree of the associated tree's root get the same label, leaves of different 
subtrees get different labels. Now, the subtree-induced urn has balls of colors 1,. . . ,K, at 
each time the number of balls of each color is identical with the numbers of leaves of the 
corresponding label. Hence the dynamic of the subtree-induced urn is that of a Polya urn 
with initially K balls, one of each color. Whenever a ball is drawn, it is placed back to the 
urn together with K — 1 balls of the same color. In other words the replacement matrix for 
the dynamic of the subtree-induced urn is a K x K diagonal matrix with all diagonal entries 
being K — 1. After n steps, we have Ir"'\K — 1) -|- 1 balls of color r. The dynamic of the 
subtree-induced urn as a multivariate Polya-Eggenberger urn is well-known, we have 



n{K + 1 n(K-l) + l 



almost surely as n — t- cx) where (Di, . . .,Dk) has the Dirichlet((K - . . . , (iC - l)'^) 
distribution. This implies the assertion. □ 

Note that a Polya urn violating the condition of constant row sums in the replacement 
matrix has an associated tree with an 1^"''^ such that I^^^ /n converges to a degenerate limit, a 
deterministic vector with one 1 and all other components 0. However, a non-degenerate limit 
for /^"^/n will be crucial below. 

System of recursive equations. We set up recursive equations for the distributions of the 
quantities and B^: For B^ we start the urn with one black ball and have a b- associated 
tree with a black root node. Now, B^ is distributed as the number of black leaves in the 
associated tree after n steps which we express as the sum of the numbers of black leaves of 
its subtrees. As discussed above, conditionally on /^"^ = ■ ■ ■ the vector of the 

numbers of balls drawn in each subtree, these subtrees are independent and distributed as 
b-associated trees or w-associated trees of the corresponding size depending on the color of 
their roots. A b-associated tree has a + 1 black and b = K — {a + 1) white children. Hence 
we obtain 

B'n = j:B';ff+ E n>l, (2) 

r=l r=a+2 

where = denotes that left and right hand side have identical distribution, wc have that 
(^fc'''^^)o<fc<n, • • • , {Bl'^''^^^)o<k<n, {B^'''°'^'^^)o<k<n, • • • , {B^'''^^)o<k<n, I^""^ are independent, 
the B^''^''' are distributed as B^, the -B^'*"*^^ are distributed as B"^ for = 0, . . . , n — 1 and the 
respective values of r. 

Similarly, we obtain a recursive distributional equation for B^. We have 

^: = i:^'(i?+ E B-^\ n>l, (3) 

r=l r=c+l 
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with conditions on independence and identical distributions as in Note that with the 
initial value {Bq,Bq ) = (1,0) the system of equations defines the sequence of pairs 

of distributions {C{B^), C{K))r 



)n>0- 



General number of colors. The approach above for urns with two colors extends directly 
to urns with an arbitrary number m > 2 of colors. We denote the replacement matrix by 
R = {aij)i<i,j<m with 



Nn 



for i / j, 



1 NoU{-l}, fori=i, 



and aij =: K — 1 > 1 for i = 1, . 



, m. 



The colors (subsequently also called types) are now numbered 1, . . . ,m and we focus on the 
number of balls of color 1 after n steps. When starting with one ball of color j we denote 
by Bn the number of color 1 balls after n steps. To formulate a system of distributional 
recurrences generalizing ^ and ([3]) we further denote the intervals of integers 



J, 



ij ■- 



with the convention [x, y] = if a; > y. Then, we have 



n No, for i < j, 
nNo, forz = j, 
nNo, fori>j, 



(4) 



1=1 r&Jii 



where, for each j £ {1, . . . , m} we have that the family 



B 



Mr) 



0<k<n 



is independent, i?^*''^^-* is distributed as B^^^ for a ll i G {1, . . . , m}, < k < n and r G Jjj and 



r G Jjj,2 G {1, , 



,m}|u 



(5) 



has the distribution as above in Lemma 2.1 



Random entries in the replacement matrix. The case of a replacement matrix with 
random entries such that all rows sum up to a deterministic and fixed K—1>1 can be covered 
by an extension of the system ([s]). Instead of formulating such an extension explicitly, we 
discuss an example in section 6.2 



3 Systems of limit equations 

We outline how systems of the form ([s]) are used subsequently. Crucial are the expansions of 
the means 



:= E r^l-^'l 



j = l,...,m. 
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We only consider cases where these means grow hnearly. Note however, that even balanced 
urns can have quite different growth orders, an example is the replacement matrix [3 5]) see 
Kotz et al. [T6]. 



Type (a). Assume that we have expansions of the form, as n — )• 00, 



c,,n 



+ djU^ + o{n^), j = 1, . . . ,m, 



with a constant > independent of j, with constants dj G M and an exponent 1/2 < A < 1. 
We call this scenario of type (a). This suggests that the variances are of the order n^'*' and 
a proper scaling is 



n > 1, j = 1, . . . ,m. 



Deriving from ([5j) a system of recurrences for the x]f ' and letting formally n — t- 00 (this is 
done explicitly in the examples in section [6]) we obtain the system of fixed-point equations 



(6) 



i=l rgJi 



where the X^'W and {Di,...,Dk) are independent, the X^'^''^ are distributed as X^, 



(^1 



, Dk) is distributed as in Lemma 



2.1 



and the b^^ are functions of (Di,...,Dk)- 



It turns out that such a system subject to centered X'^^ with finite second moments has a 

ms 
and 



unique solution on the level of distributions (Theorem 5.1). This identifies the weak limits 
Examples are in sections 



of the X 



6.1 



6.2 



One can as well obtain the same system 
(6) with fet-^l = for all j by only centering the Bn^ by c^n instead of the exact mean. Then, 
system ^ has to be solved subject to finite second moments and appropriate means. 



Type (b). Assume that we have expansions of the form, as n — )• 00, 

Cf,n + o{y/n), j = l,...,m, 



Pn 



with a constant > independent of j. We call this scenario of type (b). This suggests 
that the variances are of linear order and a proper scaling is 



X[^1 := 



pn 



Var(5[fl) 



n>l, j = 1, 



. , m 



(or \jYa.i{Bn^) replaced by \/n)- The corresponding system of fixed-point equations in the 
limit is 



i=l r&Jij 



j = l,...,m, 



(7) 



with conditions as in ([6]). Under appropriate assumptions on momen ts w e find that the 
only solution is all Xt-^^being standard normally distributed (T heor em |5.2[) . This leads to 
asymptotic normality of the X^^ . Examples are given in sections |6.l| and|6.2| The case 



[j] 



Cf,n + @{^/n), j = l,...,m, 
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leads to the same system of fixed-point equations ([T]). However, here the variances typically 
are of order nlog''(n) with a positive 5. 

Type (c). Assume that we have expansions of the form, as n — )• oo, 



,m, 



with a constant > independent of j, 1/2 < A < 1, constants Kj £ C and /U G M. (By i 
the imaginary unit is denoted.) We call this scenario of type (c). This suggests oscillating 
variances of the order n"^^. The oscillatory behavior of mean and variance can typically not 
be removed by proper scaling to obtain convergence towards a fixed limit distribution. Using 
the scaling 

Xm:=?l^^, n>l, j = l,...,m. 

it turns out that the oscillating behavior of the Xn can be captured by the system of fixed- 
point equations 

m 

J = !>••• (8) 

i=l r£Jij 

with conditions as in ^ and uj := X + i/i. Under appropriate moment assumptions this has 



a unique solution within distributions on C (Theorem 5.3). An example of a corresponding 
distributional approximation is given in section [6^ 



Note that the approach of embedding urn models into continuous time multitype branching 
processes, see [U [12], also leads to characterizations of the non- normal limits as in (|6]) and 
([s]). However, the form of the fixed-point equations is different, see the system in equation 
(3.5) in Janson |12j . Properties of such fixed-points have been studied in Chauvin et al. j6i[7|. 



4 Spaces of distributions and metrics 



In this section we define cartesian products of spaces of probability distributions and metrics 
on these products. These metric spaces will be used below to first characterize limit distri- 
butions of urn models (section [s]) and then prove convergence in distribution of the scaled 
numbers of balls of a color (section [6]) . 



Spaces. We denote by Ai^ the space of all probability distributions on 
c7-field. Moreover, we consider the subspaces 



with the Borel 



Mf:=\^C{X)€M^\E[\X\']<oo^, s > 0, 
Mfi^i):={ciX)eMf\E[X]=fiY s>l,/iGM 
MfijM, (j2) := |£(X) G Mfifi) I Var(X) = f^^} , s > 2, /i G M, a > 0. 



(9) 
(10) 
(11) 
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We need the d-fold cartesian products, d G N, of these spaces denoted by 

Mfy^ :=Mf X ■■■ xMf, (12) 

and analogously {Mf{fi))^'^ and {Mf{n,a'^))^'^. 

We also need probability distributions on the complexe plane C. By A4'^ the space of all 
probability distributions on C with the Borel cr-field is denoted. Moreover, for 7 G C we use 
the subspaces and product space 

:= {ciX) eM^\E[\X\']<oo'} , s > 0, 

-^2(7) ■.= {c{X)eM^\E[X]=^Y 

(-M2(7))"':=-Mf(7)x---xX^(7). 



To cover the different behavior of the urns two types of metrics are constructed, extensions 
of the Zolotarev metrics Cs and the minimal Lp-metric ip to the product spaces defined above. 

Zolotarev metric. The Zolotarev metric has been introduced and studied in Zolotarev 
|30|, [3T] . The contraction method based on the Zolotarev metric was systematically developed 
in |20) and, for issues that go beyond what is needed in this paper, in [15] and [21]. We only 
need the following properties: For distributions C{X), C{Y) G A4^ the Zolotarev distance Cs, 
s > 0, is defined by 

Cs{X,Y):=UC{X),C{Y)):= sup |E[/(X) - /(y)]| (13) 

where s = m + a with 0<a<l,mGNo, and 

:= {/ G C^{R,R) : \f^^\x) - f^"'\y)\ < \x - yH, (14) 

the space of m times continuously differentiable functions from M to M such that the m-th 
derivative is Holder continuous of order a with Holder-constant 1. 

We have that CsiX, Y) < 00, if all moments of orders 1, . . . , m of X and Y are equal and if 
the s-th absolute moments of X and Y are finite. Since later on the cases 1 < s < 3 are used, 
we have two basic cases: First, for 1 < s < 2 we have Cs{X, Y) < 00 for C{X), C{Y) G A4f(fi) 
for any ^ G M. Second, for 2 < s < 3 we have Cs{X, Y) < 00 for C{X), C{Y) G Mf{fi, a^) for 
any // G M and a >0. Moreover, the pairs {Mf {fi),Cs) for 1 < s < 2 and {Mfin, cr^), Cs) for 
2 < s < 3 are complete metric spaces; for the completeness see [SJ Theorem 5.1]. 

Convergence in implies weak convergence on M. Furthermore, Q is (s, +) ideal, i.e., we 
have 

Cs{X + Z,Y + Z)<Cs{X,Y), Cs{cX,cY) = cXs{X,Y) (15) 



for all Z being independent of {X,Y) and all c > 0. Note, that this implies for Xi, . . . ,Xn 
independent and Yi, . . . ,Yn independent such that the respective Cs distances are finite that 



^ E ^0 - E CsiX^,Y). (16) 



\i=l i=l / i=l 



10 



On the product spaces {Mf{^i)Y'^ for 1 < s < 2 and (A^f^(/x, ct^))^"^ for 2 < s < 3 our first 
main tool is 

C^{{ui,...,Vd),{ni,...,Hd)) ■■= raax Cs{i^j,fij), (17) 

where (z^i, . . . , i^d), (/^i, . . . , /U^) e Mfi/J.))^'^ and G (A^^(/U, cr^))^'^ respectively. Note that 
is a complete metric on the respective product spaces and induces the product topology. 

Minimal Lp-metric Ip. First for probability metrics on the real line the minimal Lp-metric 
ip, 1 < p < oo is defined by 

£p{iy, g) := inf{||y - W\\p \ C{V) = u, C{W) = g}, u, g G M^, 

where \\V - W\\p := (E[\V - W\p])^/p is the usual Lp -norm. The spaces {A4p,ip) and 
{Mp{^i),ip) for 1 < p < oo are complete metric spaces, see [5j. The infimum in the defi- 
nition of ip is a minimum. Random variables V , W with distributions v and g respectively 
such that £p{i^, g) = \\V' — W'\\p are called optimal couplings. They do exist for all u, g £ A4j'. 
We use the notation ip{X, Y) := ip{C{X) , C{Y)) for random variables X and Y. Subsequently 
also the following inequality between the £p and Cs metrics is used: 

Cs(x,y) < ((E[|x|^])i-i/^ + (E[|yr])i-i/^)4(x,y), i<s<3, (is) 

where, for 1 < s < 2, we need C{X), C{Y) € ^Af{|J.) for some E M and, for 2 < s < 3, we 
need C{X),C{Y) G Mf{fi, a"^) for some ^ e M and fi > 0, see [SI Lemma 5.7]. 
On the product space {Jv(^{{)))^'^ we define 

iiiivi, ...,Ud),{gi,..., gd)) ■■= max/2(^'j, Qj), 

where {ui, . . .,Ud), {fii, . . ■ , Hd) & (A^f (0))'^"'. Note that (A^f (0))'^°', ^^) is a complete metric 
space as well. 

Second, on the complex plane the minimal L^-metric ip is defined similarly by 

(piu, g) := mf{\\V - W\\p \ C{V) = u, £{W) = g}, u, g G M^, 

with the analogous definition of the Lp-norm. The respective metric spaces are complete as 
in the real case and optimal couplings exist as well. On the product space (A^2 (0))^*^ we use 

£^((z^i,...,i^rf),(£»i,...,£»rf)) := max i2{iyj,gj), 

where (z^i, . . . ,i'd), il^i, ■ ■ ■ £ (A^2 (O))^"'- ^ote that (A^^(O))^"', ^^) is a complete metric 
space as well. 

Preview on the use of spaces and metrics. The guidance on which space and metric to 
use in which asymptotic regime of Polya urns is as follows. We come back to the three types 
(a)-(c) of urns from the previous section: 

(a) Urns that after scaling lead to convergence to a non-normal limit distribution. Typi- 
cally such a convergence holds almost surely, however we only discuss convergence in 
distribution. 
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(b) Urns that after scaling lead to convergence to a normal limit. Such a convergence 
typically does not hold almost surely, but at least in distribution. 

(c) Urns that eben after a proper scaling do not lead to convergence. Instead there is an 
asymptotic oscillatory behavior of the distributions. Such oscillatory behavior can even 
be captured almost surely, we discuss a (weak) description for distributions. 

The cases of type (a) can be dealt with on the space (A^f (/i))^'' with appropriate /i G M 
and d G N, where, by centering, one can always achieve the choice // = 0. One can either use 
the metric or ^2 which lead to similar results, although based on different details in the 
proofs. We will only present the use of C2 1 since we do not see any advantage of £2 here. 

The cases of type (b) can be dealt with on the space {A4^{fi, cj^))^'^ with 2 < s < 3 and 
appropriate fi G M, a > and d £ N. By normalization, one can always achieve the choices 
11 = and (7 = 1. Since in the context of urns third absolute moments in type (b) cases 
typically do exist, one can use s = 3 and the metric C3 ■ We do not know how to use the ip 
metrics in type (b) cases. 

The cases of type (c) can be dealt with on the space (A^2 (t))^*^ with appropriate 7 G M 
and d € N. The metric of choice for the type (c) cases is the complex version of £2- In our 
example below we will however use (7i) x • • • x ^A2{'yd) with 71, . . . , 7^ G C to be able to 
work with a more natural scaling of the random variables, the metric still being ^2 • 



5 Associated fixed point equations 



We fix d, G N, a d X d' matrix (Air) of random variables and a vector (61,..., 6^) of 
random variables. Either all of these random variables are real or all of them are complex. 
Furthermore, we are given a. d x d' matrix {n^ir)) with all entries n{ir) G {1, . . . ,d}. First, 
we consider the case, where all Air and all bi are real. We associate a map 



xd 



xd 



T : i^M' 



(19) 
(20) 



with {A ii, • • • , Aid' , bi), Zii, . . . , Zid' independent, and Zir distributed as iJ,-^(^ir) for r — 1, . . . ,d' 
and all components i = 1, . . . , d. 

In the case, where the Air and bi are complex random variables, we define a map T' similar 
to T: 



r : (m 



xd 



xd 



(21) 



(/ii,...,/id) ^ (r{(^i,...,/id),...,r^(/ii,...,^rf)) 



with T/(//i, . . . , fid) defined as for Tj in (20). 

For the three regimes discussed in the preview within section [4] we use the following three 
theorems (Theorem 5.1 for type (a). Theorem 5.2 for type (b), and Theorem 5.3 for type (c)) 
on existence of fixed-points of T and T' . 
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Theorem 5.1. Assume that in the definition ofT in (19) and (20) the Air cLnd hi are square 
integrable real random variables with = for all 1 < i < d and 1 < r < d' and 

d' 

^^E[Ai]<l. (22) 



max 

l<i<d- 



r=l 



Then the restriction of T to (7^2(0))^ ^^-^ ^ unique fixed-point. 



Theorem 5.2. Assume that in the definition of T in (19) and (20) for some e > the Ai 
are L2+e -integrable real random variables and hi = for all 1 < i < d and 1 < r < d' , that 



Afr = 1 /O'^ ^ = 1; 



,d 



and 



r=l 



mm i 

l<i<d 



max \Air\ < 1 I > 0. 

l<r<d' 



(23) 



(24) 



Then, for all a > 0, the restriction of T to {Ai2^^{0,a ))^ has the unique fixed-point 
(AA(0,a2),...,AA(0,a2)). 



Theorem 5.3. Assume that in the definition ofT' in (21) the Air o-nd bi are square integrable 
complex random variables for all 1 < i < d and 1 < r < d' and that for 71 , . . . , 7^ E C we 
have 



l,...,d. 



(25) 



// moreover 



d' 



max 

l<i<d 



< 1 



(26) 



r=l 



then the restriction ofT' to (71) x • • • x (7d) ^(^^ unique fixed-point. 



Note that a special case of Theorem 5.1 was used in the proof of [12^ Theorem 3.9 (iii) 



with a similar proof technique as in our proof of Theorem 5.3 



The rest of this section contains the proofs of Theorems |5 . 1 1 - [5 . 3| 



Proof. (Theorem 5.1). First note that for (//i, . . . , ^u^) G {A4^{0))^'^, by independence in 
definition (20) and E[6j] = we have Tj(//i, . . . , /dd) £ A^f (0) for i = 1, . . . , d. Hence, the 
restriction of T to (A^f(O))^'^ maps into (A^f(O))^'^. 

Next, we show that the restriction ofT to (7W^(0))^'^ is a (strict) contraction with respect 
to the metric (2^: For {fii, . . . (z^i, ■ ■ ■ ,Vd) ^ (A^flO))^'^ first fix i e {1, . . . Let 
Zi\, . . . , Zid' and Z[i, . . . , Z-^, be real random variables such that Zir is distributed as /U^(jr) 
and Z[^ is distributed as v-i^^ir)- Moreover, assume that both families {{An, . . . ., Ai^i ,bi), 
Zji, . . . , Zid'} and {{An, . . . , Ai^i, bi), Z[^, . . . , Z'-^,} are independent. Then we have 



Ti(//i, . . . , ^d) = £ ^ AirZir +bi \ , Ti{vi, .... 



^d) 



c(j2AirZ'ir + b}j . (27) 
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Conditioning on {An, . . . , Ai^', hi) and denoting this vector's distribution by T we obtain 

C2(7i(/Wl, • • • , fid),Ti{ui, . . . , l/rf)) 



sup 



f f d' d' \ 

Qi,...,arf/,/3) 

\r=l r=l / 



< / sup 



dT(ai,...,Q(i/,^) 



(28) 



Since C2 is (2, +)-ideal, we obtain from (15) that C2{Y1 a-rZir+jS, arZ[^+l3) < ^ a^C2(-^jr, Z[^). 
Hence, we can further estimate 

C2(7i(/ii, . . .,fid),Ti{iyi, . . .,Vd)) 
„ d' 

- / '^(4C2iZir,Z-^)dT{ai,...,ad',/3) 

r=l 
„ d' 

= / arC2ilJ-n(ir),^niir)) dT{ai,..., Ud' , P) 

< |^^E[4,]^C2((^i,...,/i,),(i/i,...,i/rf)). (29) 
Now, taking the maximum over i yields 

C2^(r(/.i,...,/.rf),T(z.i,...,i/^)) < f max Ve[A2,]^ C2 ((/^i, . . . , /x^), (i^i, . . . , z.^)). (30) 



Hence, condition (22) imphes that the restriction of T to {M^i^))^ is a contraction. Since 
the metric ^2 is complete, Banach's fixed-point theorem implies the assertion. □ 



Proof. (Theorem 5.2). This proof is similar to the previous proof of Theorem 5.1 Let 



e > be as in Theorem 5.2 and o" > be arbitrary. First note that for (fii, . . . , fj,^) G 
(Alf^_^(0, (7^))^'^, by independence in definition (20), condition (23), and 6j = we have 
Tiifii, ...,fid) -^2+e(0' f^^) for i = 1, . . . , d. Hence, the restriction of T to {Mf^^{0, cr^))^'^ 
maps into (Xf+,(0, fj^))^^. 

We set s := (2 + e) A 3. For (^1, . . . , ^rf), (i^i, . . . , z^^) G {M^_^_^{0,a'^))^'^ we choose 
Zii, . . . , Zid' and Z--^^, . . . , Z-^, as in the proof of Theorem 5.1 such that we have (27). Note 
that with our choice of s we have Cs(rj(/ii, . . . , fid),Ti{ui, . . . , i^d)) < 00. With an estimate 



analogous to (28) - (30), using now that is (s, -|-)-ideal, we obtain 



CsiT{fJ'i,---,tJ-d),T{iyi,...,Ud 



)) ^ fmax Cs{{f^i,---,f^d),{iyi,..-,iyd)). 
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Note that s > 2 and the conditions ^ and ([24|) imply that ErLi IE[|-4ir-|1 < 1 for all i = 
1, . . . , d. Hence, the restriction of T to (7Wf^^(0, cj^))^'^ is a contraction and the completeness 
of Cs implies the existence of a unique fixed-point. With the convolution property M{0, af) * 
Af{0, o\) = AA(0, al + al) for cJi, £72 > one can directly check that {N{Q, ci^), . . . , A/'(0, a"^)) 
is a fixed-point of T in (A^f+,(0, a'^)Y'^. □ 



Proof. (Theorem 5.3). Let 71, . . . , be as in Theorem 5.3 and abbreviate V := (71) x 
• • • X A^2(7d)- First note that for (^ui, . . . , fi^) G ^ from independence in the definition of 
r/(/Lii, . . . , /irf) and the finite second moments of the Air and bi we obtain T/(;Ui, . . . , //rf) € 
for alH = 1, . . . , d. For a random variable W with distribution T/(;Ui, . . . , ^u^) we have 

d' 

r=l 



by condition (25). Hence, the restriction of T' to V maps into V. 

Next, we show that the restriction of T' to is a contraction with respect to the metric ^2 • 
For (^1, . . . , //rf), (i/i, . . . , Ud) G V we first fix z S {1, . . . , d}. Let {Zir, Z'-^) be an optimal cou- 
pling of and for r = 1, . . . , d' such that {Zn, Z'-^),..., {Zi^' , Z'-^,), {An, Ai^'M) 
are independent. Then we have 



...,fld)=C ^irZir + bi^ , Tl{ui, . . . , Ud) = C AirZ[^ + b-^ 



(31) 



Denoting by 7 the complex conjugate of 7 G C we obtain 

eliTlim, . . . , fid),Tl{ui, . . . ,Ud)) 
d' 

^ ^ Air{Zir — Zj^j. 



< E 



r=l 
d' 



^tr I I Zir 



+ E 



Air{Zir — Z[^)Ait{Zit — Z'j^^ 



El-* 

r=l 

= [l^irP] ^i(Ai7r(ir),i^7r(ir)) 

r=l 



(32) 



For equality (32) we firstly use that Zij. — Z[^ and Za — Z[^ are independent, centered factors, 
so that the expectation of the sum over r 7^ t is and secondly that [Zi^^Z^^ are optimal 



couplings of (/U,r(ir)j i^7r(ir)) such that IE[|Zjr — Z\^^\ 
Now, taking the maximum over i yields 



£^(r'(/xi, . . . , iid).T\vr, ...,Vd))< f max Ve {\A„\^\\ 



^2(Mvr(ir)) ^7r(ir)) 
1/2 



£2 ((/"I, • • • ,/"d), (z^i, • • • ,l^d))- 



Hence, condition (26) implies that the restriction of T' to "P is a contraction. Since the metric 
^2 is complete, Banach's fixed-point theorem implies the assertion. 



□ 
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6 Convergence and examples 



In this section a couple of concrete Polya urns are considered and convergence of the normal- 
ized numbers of balls of a color is shown within the product metrics defined in section |4j The 
proofs are generic such that they can easily be transferred to other urns of the types (a)-(c) 
in section [Sj We always show limit laws for the initial compositions of the urn with one ball 
of (arbitrary) color. Limit laws for other initial compositions can be obtained from these by 
appropriate convolution with coefficients which are powers of components of an independent 
Dirichlet distributed vector. We leave the details to the reader. 



6.1 2x2 deterministic replacement urns 

A discussion of urns with a general balanced 2x2 replacement matrix as in ([T]) is given in 
Bagchi and Pal Subsequently, we assume the conditions in ([T]) and, as in pj, that be > 0. 
As shown in [2] asymptotic normal behavior occurs for these urns when a — c < {a + b)/2 
(type (b) in section |4]) , whereas a — c > {a + b)/2 leads to limit laws with non- normal limit 
distributions (type (a) in section |4]). In this section we show how to derive these results 
by our contraction approach. With and as in the beginning of section [2] we denote 
expectations by /Ub(n) and ^J-win-). These values can be derived exactly, see [2], 



IJ-h{n) = —^ -n+ — \^ \^ + -r ' (33 

b + c (6 + c)r(l±f^) r(n+^) b + c 



^wn = \ ' n ^ ^ V ^+ 34 



a+b I \ a+b 

Non-normal limit case. We first discuss the non-normal case a — c> (a + b)/2. Note that 
with A := (a — c)/(a + 6) and excluding the case be = 0, we have 1/2 < A < 1 and, as n — t- 00, 

/Xb(n) = Cbn + dbn"*" + o(n^), ii„{n) = Cy^n + dv,n^ + o{n^) (35) 



with 



c{a + b) br[^, cT^^ 
Cb = Cw = ^— , 4 = V-^ V' = V- (36) 



+ c)r(^) {b + c)T 

We use the normalizations Xq := Yq and 

X„:=^^^^, y„:= ^"-^("\ n>l. (37) 

Note that we do not have to identify the order of the variance in advance. It turns out that it 
is sufficient to use the order of the error terms db^^ and d^ri^ in the expansions (35). From 



the system we obtain for the scaled quantities X„, y„ the system, for n > 1, 



r=a+2 




^ / {n) 



r=c+l 



n 



;Vn)+^w(n), (39) 
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with 



a+l / (n) \ ^ \ K / („) \ A 

Mn) = 4ERr -1+ E ^ 



(40) 



(41) 



r=l 



r=c+l 



with conditions on independence and identical distributions analogously to ^ and ([3]). In 
view of Lemma |2.1| this suggests for limits X and Y of X„ and 1^ respectively 



a+l 



r=l r=a+2 



(42) 
(43) 



r=l 



r=c+l 



with 



/ a+l \ 

6b = 4 -1 + I^M + C^w X] ^ 

\ r=l / r=a+2 

c / K 



A 

r ) 



r=l 



r=c+l 



where (Di,..., Dk), X'^^\ X^-^\ Y(^\ y(^) are independent, and the are dis- 
tributed as X, the Y^^^ are distributed as Y and (-Di, . . . , Dk) is as in Lemma 2.1 Note that 
the moments E[Z)^] and the form of db and d„ in (36) imply E[6b] = IE [few] = 0. From A > 1/2 
and J2^=i -Dr = 1 we obtain 



< 1. 



Hence, Theorem 5.1 applies to the map associated to the system (42)-(43) and implies that 
there exists a unique solution (£(Ab), £(Aw)) in the space (0) x Mf{0) to (|42|-([43|. The 
following convergence proof resembles ideas from Neininger and Riischendorf |20j . 

Theorem 6.1. Consider the Polya urn with replacement matrix with a — c > {a + b)/2 
and be > and the normalized numbers Xn and Yn of black balls as i n (fj'Tj )- Furthermore 
let (/:(Ab),£(Aw)) denote the in A^2(0) ^ -^2(0) unique solution of Then, as 

n — )• 00, 



C2^((x„,y„),(Ab,Aw))^o. 



In particular, as n —)• 00, 



Xn > Ab, Yn > Aw 



(44) 
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Proof. We first define, for n > 1, the accompanying sequences 

if 
n 

r(") 
n 



a+l 
r=l 



K /j(n)\ 
r=a+2 \ / 



Al'^ + 6b(n), 



Ai;')+6w(n), 



(45) 
(46) 



r=l 



r=c+l 
A Or) 



with hb{n) and b^{n) as in (40) and the A^^*" , and /^"^ being independent, where the 



K^^'^ are distributed as Ab and the A^"" are distributed as A„ for the respective values of r. 
Note that Q\ and are centered with finite second moment since £(Ab),/2(Ab) G M^[Q). 
Hence, C,2 distances between X„, y„, Qjj, Q^, Ab and A„ are finite. To bound 

A(n) :=C2^((X,,yn),(Ab,A„)) 

we look at the distances 

Ab(n) := C2(A:„, Ab), A„(n) := C2{YnA^)- 
We start with the estimate 

C2(^n,Ab) < UXn,Ql) + C2{Ql,K)- (47) 

We first show for the second summand in the latter display that C2(Qni Ab) — )• as n — )• oo: 
With inequality (18) we have 

C2(Q^Ab)<(||Q:j||2 + ||Ab||2)^2(g^,Ab). 

Moreover, ||Ab||2 < oo since >C(Ab) G and, by definition of Q\ and with \Ir^'^ /n\ < 1 we 
have that ||(5jj||2 is uniformly bounded in n. Hence, it is sufficient to show £2(Qn;Ab) 0. 
We have, using the independence properties in (45) and (42), that 

^2(Q^,Ab) 



a+l 



r=l 



' An) \ ^ 
It \ 



n 



Di 



lA^II 

Au 2 



K 



Lemma 

6b||2^ 



2.1 



2 

(n) 



r=a+2 



n 



in)' 



n 



D. 



||AM||2 + ||6,(n)-6b||2. 



implies that ||(/r"V^)'^ ~ -^rlb — as n — t- oo, which as well implies ||6b('^) 



0. Hence, we obtain ^2(Qn, Ab) ^ and C2{QnM ^ 0. 
Next, we bound the first summand C,2{Xn,Q^) in (47). We condition on 1^"^ Note that 
conditionally on /("^ we have that b\y{n) is deterministic, which, for integration, we denote by 
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/3 = /3(/'^")). Denoting the distribution of /^"^ by T„ and i := {ii, . . . jix) this yields 

» /a+l / • N A 

c.(x„,q::)</c.(^E(^) E 

a+l 

E 



r=l 



r=a+2 



n ■ 



r=a+2 



a+l X 2A 



r=l 
a+l 

r=l 
K 

<EiE 

r=l 



C2(XM,A«)+ E (^) C2(yr'AM)dT„(i) (48) 



n 



AW+/3 dT„(i) 



7^ 

n 



2A 



Ab(4")) 



r=a+2 

is: 

+ E ^ 

r=a+2 



2A 



(n)' 



2A 



n 



An)' 



n 



2A 



A(4")) 



where, for (48) we use that ^2 is (2, +) ideal, as well as (16). Altogether, the estimate started 



in (47) yields 



K 



Ab(n) <ElE 



r=l 



2A 



n 



A(4")) 



+ 0(1). 



With the same argument we obtain the same upper bound for A^{n). Thus, using also that 
, . . • , Ix'^ are identically distributed we have 



A(n) < KE 



n 



2A 



A(/i")) 



+ o(l). 



(49) 



Now, a standard argument implies A(n) — )• as follows: First from (49) we obtain with 
/{"V" Di in L2 and, by A > 1/2, with d := KE[Dl^] < 1 that 

2A1 



A(n) < KE 



I 



in)- 

n 



max A(A;) + oil) 

0<fc<n-l 



<(^ + o(l)) max A(A:) + o(l). 

0<A;<ra— 1 

Since -& < 1 this implies that the sequence (A(n))„>o is bounded. We denote rj := sup„>o A(n) 
and ^ := limsup„_j.oo A(?i). For any e > there exists an no > such that A(n) < ^ + e for 



all n>no. Hence, from (49) we obtain 

2A' 



A(n) < KE 



•q + KE 



^{'l"'>«o} \^ ) 



2A 



(e + e) + o(l). 



With n — )• 00 this implies 



e<^?(C + e). 



19 



Since i) < 1 and e > is arbitrary this implies ^ = 0. Hence, we have C2 {{Xn, Yn), (Ab, A^)) — )■ 
as n — )• 00. Since convergence in ^2 iniphes weak convergence, this imphes (|44|) as well. □ 



The normal limit case. Now, we discuss the normal limit case a — c < (a + b) /2, where we 
first consider a — c < {a + b)/2. (The remaining case a — c = (a + 6)/2 is similar with more 
involved expansions for the first two moments.) The formulae (33), ( |34[ ) now imply 

(50) 



/ib(n) = Cbn + o(vn), ii„{n) = c„n + o{^/n) 



with Cb and Cw as in (36). As it is usual in the use of the contraction method for proving 
normal limit laws based on the metric ^3 we need also an expansion of the variance. We 
denote the variances of and by cr^in) and o''^{n). Additionally to 6c = we exclude 
the case a = c. (In this case there is a trivial non-random evolution of the urn). From [2] we 
have as n — 7- 00: 



(^lin) = fhn + o{n), crl{n) = Un + o(n), 



(51) 



with 



/b = /» 



bc{a 



> 0. 



(a + 6-2^)(a + 6)(6 + c 

We use the normalizations Xq := Yq := Xi := li := and 

_ B"^- Hb{n) __ B^ - fi^jn) 

fTb(n) <7w(n) 

From the system ([2])-([3]) we obtain for the scaled quantities X„, Yn the system, for n > 1, 



n>2. 



(52) 



Xn 



E 



1 Mn) 



(^w{Ir )^(r) 



a+1 _ ,j(n)^ ^ ^ ^ „ (T^")^ 

K ,Jn)- 



E 



<7b(-^i 



r=a+2 



(n) 



5^ ;Ueb(n), 



1 (^h{n) 



r=c+l 



(n) + 



(53) 



(54) 



with conditions on independence and identical distributions analogously to ^ and We 
have ||eb(n)||oo, ||ew('i-)||oo — ^ since the leading linear terms in the expansions (50) cancel 
out and the error terms o{^/n) are asymptotically eliminated by the scaling of order l/i/n. 
In view of Lemma 2.1 this suggests for limits X and Y of Xn and Yn respectively 

a+1 



X = J2 y^rX^''^ + 

r=l 

c 

Y = Y, Vd'vX^'''^ + 



r=l 



K 

E 

r=a+2 
K 

E 

r=c+l 



(55) 



(56) 



where (Z?i, . . . , i^i^), X^^), . . . , X'^^\ Y^^\ y(^) are independent, and the are dis- 
tributed as X and the Y^"^^ are distributed as Y. To the map associated to the system 
(55)-(56) we can apply Theorem 5.2, The conditions (23) and (24) are trivially satis- 
fied. Hence (A/^(0, 1), A/'(0, 1)) is the unique fixed-point of the associated map in the space 
Mf{0,l) xMf{0,l)- 
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Theorem 6.2. Consider the Polya urn with replacement matrix ^ with a — c < {a + b)/2 
and bc> and the normalized numbers X„ and Yn of black balls as in (52). Then, as n ^ oo, 

C3^((x„,y„),(AA(o,i),AA(o,i)))^o. 

In particular, as n ^ oo, 

Xn^Af{0,l), y„AAA(0,l). (57) 

Proof. The proof of this Theorem can be done along the hnes of the proof of Theorem |6.1[ 
However, more care has to be taken in the definition of the quantities corresponding to 
and in (45) in order to assure finiteness of the (3 distances. A possible choice is, for n > 2, 



a+l 



>"{4"'>2} ab(n) 



CTw(/i 



r=a+2 
K 



-Nr + eb(n). 



-Nr + ew(ra). 



(58) 
(59) 



with eb(?T-) and ew(?T-) as in (53)-(54) and A'^i, . . . ,Nx, independent, where the Nr are 
standard normally distributed for r = 1, . . . , K . A comparison on the definition of Q}^ and 
Qn with the right hand sides of (53) and (54) and the scaling (52) yields that we have 
E[Q^] = E[QZ] = and Var(g^) = Var(Q;^) = 1 for all n > 2. Obviously, we also have 
IIQnIls) IIQn lis < Hence, ^3 distances between X„, Yn, Q^, QZi ^^'^ -^(0) 1) a-re finite for 
all n > 2. With 



A(n) 
Ab(n) 
Aw(n) 



C3^((X„,y„),(AA(0,l),AA(0,l))), 

C3(X„,AA(0,1)), 

C3(y„,AA(0,l)) 



we also start with 



C3{Xn,M{0, 1)) < CsiXn, Ql) + CsiQtMiO, 1)). 



Analogously to the proof of Theorem 6.1 we obtain C3(Qn)-^(0, 1)) — a s n — )■ 00 

The bound for C,3{Xn,Qn) is also analogous to the proof of Theorem 
used that Cs is (3, +) ideal instead of (2, +) ideal. This yields 



6.1 



where now is 



a+l 



C3(^n,Qlj)<^E 



r=l 



Mir) 

(Tb(n) 



A(4")) 



K 



+ E IE 

r=a+2 



o■w(?^) 



A(4' 



Then we argue as in the previous proof to obtain analogous to ( |49 ) 



r=l 



C7b("') 



+ E IE 

r=a+2 



'^w(/^"^) 



A(4")) 



+ o(l). 
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From this estimate we can deduce A(n) — )• as for A(n) in the proof of Theorem 6.1, where 

th( 



we need to use that from the expansions (51) and Lemma 2.1 we obtain, as n — )• oo, that 

3^ " 



a+l 
r=l 



K 

+ E E 

r=a+2 



in), 



<7w(n) 



r=l 



< 1. 



(60) 



□ 



Remarks. (1) Note that the proof of Theorem 6.2 cannot be done in the metric since 

crb(/r ; 



the term corresponding to (|60|) then is 



■r=l 



fTb(n) 



+ E ^ 

r=a+2 



1, 



where a limit < 1 is required to obtain A(n) — t- 0. This is the reason, why we use C3 • It is 

possible to use C for any 2 < s < 3 leading to the hmit Ylr=i ^W] < 1- 

(2) The case a — c = {a + h)/2 differs in the error terms in (50) which then become 0{^/n). 



Since the variances in (^51^ get additional logarithmic factors we still obtain the system (55) 



(56) and our proof technique can be applied as well. 



(3) The condition 6c > cannot be dropped. In our approach this would lead to degenerate 
systems of limit equations that do not identify limit laws. It is known that under 6c = 
different asymptotic behavior appears. 

6.2 An urn with random replacements 

As an example for random entries in the replacement matrix R we consider a simple model 
with two colors, black and white. In each step when drawing a black ball, a coin is inde- 
pendently tossed to decide whether the black ball is placed back together with another black 
ball or together with another white ball. The probability for success (a second black ball) is 
denoted by < a < 1. Similarly, if a white ball is drawn a coin with probability < /3 < 1 is 
tossed to decide whether a second white ball or a black ball is placed back together with the 
white ball. We denote the replacement matrix by 



R 



Fa 

1-Fp 



(61) 



where Fa and Fp denote Bernoulli random variables being 1 with probabilities a and /3 
respectively, otherwise 0. This urn model has been introduced in the context of clinical trials 
and been studied together with generahzations in [2811291 [2711261 [T9l [3l lil fT2] . 



The row sums of R in (61) are both deterministically equal to one, hence the urn is 



balanced. Again, the number of black balls after n draws starting with an initial composition 
with one black ball is denoted by B]^, when starting with a white ball by S™. According to 
our approach in section [2] we obtain the recursive equation 



II' In Jn 



+ il-Fa)BJ. n>l. 



(62) 



B 



b,(2)- 



where (-B^'*-^'')o<fe<n, ' 

are distributed as B^ for k 



0<k<m {B^ 
0,... 



'0<fc<n, 



b,(l) 



Fa and /„ are independent, and B, and 



n- 



1, and /„ is uniformly distributed on {0, 
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1} while Jn ■= n—l—In- (The uniform distribution of I„ follows from the uniform distribution 
of the number of balls in the [J 5]-P61ya urn.) Similarly, we obtain for that 



b:! = B 



w,(l) 



+ FgB 



w,{2) 



n > 1, 



(63) 



initial value {Bq,Bq) = (1,0) the system of equations (62)-(63) again defin es the sequence 
of pairs of distributions {C{B^) , C{B^))n>o- As a special case of Lemma 



2.1 



with conditions on independence and identical distributions similar to (62 ). Together with the 

les the se 
we have 

{In,Jn)^{U,l-U) ,(n^oo), (64) 
almost surely where U is uniformly distributed on [0, 1]. Furthermore, we denote for n > 

^b(n) :=E[i?^], ^„(n) :=E[B-]. (65) 
These means have been studied before. We have the following exact formulae: 



Lemma 6.3. For fihin) and ^^{n) as in (65) with < a,/3 < 1 we have 



1-/? 
2-a- /3 
1-/3 



n + 



n 



1 — a r(n + a + /3) 
2 - a- {3T{a + l3)T{n + r 
1-/3 r(n + a + /3) 



+ 



+ 



1-/3 
2-a- f3' 
1-/3 



(66) 
(67) 



2-a-(3'~ 2-a-/3r(a + /3)r(n + l) ' 2-a-/3' 
Proof. A proof is based on matrix diagonalization and can elementary be done along the lines 



of the proof of Lemma 6.7 below. 



□ 



As in the example from section 6.1 we have two different types of limit laws, with normal 
limit for a + /3 < 3/2 and with non- normal limit for a + (3 > 3/2. 



The non-normal limit case. We assume that A := a + /3 — 1 > 1/2. From Lemma 6.3 
obtain the asymptotic expressions, as n — )• oo. 



we 



with constants 



/ 



1-/3 



« 1- A' 
We use the normalizations Xq 



d'h 



1 



a 



(i-A)r(A + i)' 

Yo := and 



dL 



1-/3 



:i-A)r(A + i) 



n> 1. 



(68) 



(69) 



As in the non- normal case of the example in section 6.1 it is sufficient to use the order of the 



error term of the mean for the scaling. From (62)-(63) we obtain for n > 1, 

A / T \ A 



X„ 



J, 



J, 



4? + (1 - Fc.) 



yP + (1 



n 
n 



A 



Yjn+bM, 



(70) 
(71) 
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with 



6'b(n) 



b'^{n) = d[ 




1 + 4(1 - F„ 



J, 



+ 0(1), 



+dUl-F,s) (^)' + o(l), 



with conditions on independence and identical distributions analogously to (62)-(63). In view 
of ( 64 ) this suggests for limits X and Y of Xn and Yn that 

X = + Fail - c/)^x(2) + (1 - Fa) (1 - c/)^y(i) + 6;, 
Y = ?7^y(i) + F;3(i - c/)^y(2) ^ _ (1 _ f/)A^(i) ^ ^/^^ 



(72) 
(73) 



with 



K = d[,(u^ + Fail - U)^ - 1 ) + 4(1 - Fa)il - U) 



where X^^), y(i), y(2) and ^7 are independent and X^^\ X^"^^ are distributed as X and 

y(2) distributed as 
To check that Theorem 5.1 can be apphed to the map associated to the system (72)-(73) 
first note that the form of d'^ and d'^ in (68) implies IE[6[j] = IE[6(^] = 0. To check condition 
(22) note that we have 



E 



U 



2\ 



+ E 



Fail - U) 



2A 



+ E 



il-Fa)il-U) 



2\ 



2A + 1 



< 1, 



since A > 1/2. Analogously, we have E[U^^] + E[F^(1 - U)^^] + E[(l - F^)(l - U)^^] = 
2/(2A + 1) < 1. Together, this verifies condition (22). Hence Theorem 5.1 can be applied and 
yields a unique fixed-point (£(A[,), £(A;,)) in Mfjo) x MfiO) to (|72(-(|73|. 



Theorem 6.4. Consider the Polya urn with random replacement matrix (61) with a, P ^ 
(0, 1) and a + (3 > 3/2 and the normalized numbers Xn and Yn of black balls as in 
Furthermore let (£(A'j,), £(A^)) denote the in A^f (0) x A^f (0) unique solution of [12)- 
Then, as n —)• oo, 



W). 

'73). 



Xr, 



A[j, Yn 



AL 



Proof. Analogously to the proof of Theorem 6.1 



□ 



The normal limit case. Now, we discuss the normal limit case X := a + (3 — 1 < 1/2. We 
first assume X := a + (3 — 1 < 1/2. The expansions from Lemma 6.3 now imply, as n — )■ oo 

//b(n) = Cbn + o(-v/n), ji^in) = c„n + oi^/n) (74) 



with Cb and c„ given in ( |68[ ). As in the normal limit cases in the examples in section [(TT] we 
first need asymptotic expressions for the variances. We denote the variances of and 
with (3"^(n) and (T^(n). These can be obtained from a result of Matthews and Rosenberger 
1191 for the number of draws of each color as follows: 
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Lemma 6.5. We have, as n ^ oo, 

^hin) = fi,n + o{n), (7^(n) = /4n + o(n), 

with 

(1- a) (1-/3)/ 1 



(75) 



fh fw 



(i-xy 



1 - 2A 



2A(1 + A) > 



Proof. In |19] , for the present urn model, the number A'^^ of draws within the first n draws in 
which a black ball is drawn is studied. Starting with one black ball it is established in |19j . 
as n — >■ oo, that 



E[Nn] 



1-/3 
1 - A 



n + o(n) 



Var(iV.),= (^-«)(^;^)(^ + ^^) n + o(n). 
(1-A)2(1-2A) 

As each black ball in the urn is either the first ball, or has been added after drawing a black 
ball and having success in tossing the corresponding coin, or after drawing a white ball and 
having no success in tossing the coin, we can directly link Nn to i?^: Denoting the coin 
flips after drawing black balls by (-P'j')i<j<Af„i the coin flips after drawing white balls by 
(^7)i<i<(ri-Afn) we have 

Nn n-Nn 

Using that all coin flips are independent we obtain from the law of total variance by condi- 
tioning on Nn that 



al{n)=E[Yar(^B^\Nn 
_ (l-a)(l-/3) 



+ Var E 



1 

1 - 2A 



2A(1 + A) n + o{n). 



{1-xr 

When starting with one white ball, a similar argument gives the corresponding result. 
We use the normalizations Xq := Yq := and 



fit 



o-b(n) 



Y •- 



<5"w(n) 



n > 1. 



□ 



(76) 



From the system (62)-(63) we obtain for the scaled quantities Xn, the system, for n > 1, 

. -b^^ji) + F^^xf^ + (1 - F.) ^Yj„ + e'Jn), 
(Thin) ^" crb(n) crw(n) 

. M^y(i) + f.^yP + (1 - F,) ¥^Xj„ + eUn), 



with conditions on independence and identical distributions analogously to (62)-(63). We 
have ||e[j(n)||oo, ||e^('^)||oo — ^ since the leading linear terms in the expansions (74) cancel 
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out and the error terms o{^/n) are asymptotically eliminated by the scaling of order 1/^/n. 



In view of ( 64 ) this suggests for limits X and Y of X„ and Yn respectively 

X = Vux^^'^ + F^^/r^x(2) + (1 - F„) yr^y(i), 

Y = \/c7y(i) + Fpy/l - UY^'^'^ + (1 - i^/?) \/i - ux^^\ 



(77) 
(78) 



where X^^), X^'^\ Y^^\ F^^) g^^^ ^ ^^^e independent and X^'^\ X^^) are distributed as X and 
yd), y(2) 

are distributed as Y. To the map associated to the system (^77h-(^78^ we can apply 



Theorem 5.2 The conditions (23) and (24) are trivially satisfied. Hence (A/'(0, 1),AA(0, 1)) is 



the unique fixed-point of the associated map in the space Aif{0, 1) x A4f{0, 1). 

Theorem 6.6. Consider the Polya urn with random replacement matrix with a, P ^ 



(0, 1) and a + f3 < 3/2 and the normalized numbers Xn and Yn of black balls as in (76). 
Then, as n —)• oo, 



Xn 



AA(0,1), Yn^M{Q,l). 



Proof. Analogously to the proof of Theorem |6.2 



(79) 

□ 



Remark. The case a + (3 = 3/2 differs in the error terms in (74) which then become 0{y/n). 



Since the variances in ( 75 ) get additional logarithmic factors we still obtain the system ( 77 ) 



( 78 ) and our proof technique does still apply. 



6.3 Cyclic urns 

We fix an integer m > 2 and consider an urn with balls of types 1, . . . , m. After drawing a 
ball of type j it is placed back to the urn together with a ball of type j + 1 if 1 < j < m — 1 
and together with a ball of type 1 if j = m. These urn models are called cyclic urns. Thus, 
the replacement matrix of a cyclic urn has the form 



R 



1 








1 





(80) 



We denote by Rn the number of type 1 balls after n draws when initially one ball of type j is 
contained in the urn. Our recursive approach described above yields the system of recursive 
distributional equations 



(81) 



?[3] 



— Kr + rt T , 



where, on the right hand sides, and Rj^ for j = 1, 
In uniformly distributed on {0, . . . , n — 1} and J„ = 



n 



, m, fc = 0, . . . , n — 1 are independent, 

- 1 - In. 
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We denote the imaginary unit by i and use the primitive roots of unity 



UJ := Ur, 



exp 



27ri 
m 



(82) 



with A, /i G M. Note that for 2 < m < 6 we have A < 1/2, while for m > 7 we have A > 1/2. 
Asymptotic expressions for the mean of the can be found (together with further analysis) 
in |11[ 1121 [22]. To keep this section self-contained we give an exact formula for later use: 

>[j] 



Lemma 6.7. Let Rn he the number of balls of color 1 after n draws in a cyclic urn with 
artin 



m>2 colors, starting with one ball of color j . Then, with uj = Um as in (82) we have 

Tin + l + u)^) 



E 



n+l 1 
+ - 



E 



m m ^ ^> , , r(n + 1) r(a;'^ + 1) 

fce{l,...,m}\{m/2} ' ^ > 



In particular, we have E[i2[f'] = — n + 0(1) for m = 2,3,4 and, for m > 4, as n ^ oo, 



k(j-l) 



(83) 



E 



— n + K(K,n*'')n^ + o(n^), k,- := -. 

m ^ ^ ^ ^ ^' J mr(a; + l) 



(84) 



Proof. Using the system (81), we obtain by conditioning on /„, for any 1 < i < m, 

n— 1 



n— 1 



E 



1=0 i=0 

i(E[4^ij+E[e^] 



E 



[i+i] 

n — 1 
+ 



E 



n 



Rb1 



R\ 



■n-l 



+ -E 

n 



[j+i] 



With the column vector i?„ := (i?n', . . . ,i?|l"'), the replacement matrix R in (80) and the 
identity matrix Idm this is rewritten as 

E [Rn] = (^Id„ + ^R^ E [Rn-l] = fl (idm + ^i?) E[i?o] ■ 



The eigenvalues of the replacement matrix are all m-th roots of unity uji := 



l,...,m, 



and a possible eigenbasis is vg := ^(w^ , . . . , ^)*, £ = 1, . . . ,m. Decomposing the mapping 
induced by R into the projections 7r„^ onto the respective eigenspaces we obtain 

" / 1 \ m n , ^ m-1 t/ , i , \ 



. . ^ r(w, + i)r(n + 1)' 

k=i ^ ' i=ik=i ^ ' i=\ ^- ' > ^ ' > 

Moreover, vr^,^. (E[i?o]) = and Vm = ^(1,...,1), hence the j-th component of the latter 
display implies ( 83 ) . 



The asymptotic expansion in (84) is now directly read off: Note that the roots of unity 
come in conjugate pairs tJ^ = LOm-i- If ^ is even, uJm/2 = = otherwise only Um = 1 
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is real. Combining pairs of summands for such conjugate pairs and using T{z) = T{z), we 
obtain the terms 

r(n+ 1 r(n + 1 + w^) cJ^J'-i f uj'^ r{n + 1 + uji)\ 



r(n + 1) r(a;£ + 1) r(n + 1) r(a;^ + 1) \^ r(a;£ + 1) r(n + 1) ^ ' 

By StirHng approximation the asymptotic growth order of the latter term is 3?(n'^^), hence 
the dominant asymptotic term is for the conjugate pair with largest real part, co and com-i- 



This implies (84) for m > 4. For m = 3,4: the periodic term is o(l) respectively 0(1), for 



m = 2 there is no periodic fluctuation. □ 

We do not discuss limit laws for the cases 2 < m < 6 in detail. They lead to asymptotic 
normality as has been shown with different proofs in Janson [11] and Janson |12| Example 
7.9]. These cases can be covered by our approach similarly to the normal cases in sections 



6.1| and 6.2, For 2 < m < 6, the system of limit equations is 



and Theorem 5.2 applies 



We now assume m > 7. In particular, we have the asymptotic expansion (84) of the mean 



of the -Rn' with A > 1/2. We define the normalizations 



n 



Hence, we obtain for the X„ the system 



where, on the right hand sides, and for j = 1, . . . , m, = 0, . . . , n — 1 are independent. 

To describe the asymptotic periodic behavior of the distributions of the X„ , we use the 
following related system of limit equations: 

xw = c/'^xw + (i-c/)^x[2], 
xP] ^c/"x[2l + (i-;7rx[3], 



xH = c/^xH + (1 - c/)'^x[il 
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Since uj is complex this now has to be considered as a system to solve for distributions 
. . . ,£(X[™1) on the complex plane C. The corresponding map T is a special case of 
T' in (I2TI): 



(^1, . . . 1-^ (Ti(^i, . . . ,/im,), • • • ,TjnifJ.l, ■ ■ ■ , fJ-m)) 



(86) 



for j = 1, . . . ,m with U, V^^^, . . . , yl^+^l independent, U uniformly distributed on [0, 1] and 
£(yb1) = /jj for j = 1, . . . , m and ^(yl^+i]) = 

Lemma 6.8. Let m > 7. The restriction of T to TW^'l'^i) x ••• x A^fl'^m.) has a unique 
fixed-point. 



Proof. We verify the conditions of Theorem 5.3 First note that condition (25) for our T in 

(87) 



(pl) is 



E [[7'^] Uj + E [(1 - ^7)"^] Kj+i 



J 



1, 



with Km+i ■= Ki. Since E[[/'^] = E[(l - U)'^] = (1 + a;)"^ and kj+i = ujkj we find that ([87) 
is satisfied. Condition (26) for our T is 



E +E [|(1 - Uf'^W < 1. 



Since m > 7, we have A > 1/2, thus E[|C/^'^|] + E[|(l - U) 



\2w| 



2/(1 + 2A) < 1. Hence 

□ 



Theorem 5.3 applies and implies the assertion. 

The fixed-point in Lemma |6.8| has a particularly simple structure as follows: 

Lemma 6.9. Let m > 7 and (>C(AW), . . . , /:(AH)) be the unique fixed-point in Lemma 
Furthermore let C{A) be the (unique) fixed-point of 



6.8 



X = U'^X + uj{l-UYX' in 



mT{oj + I) 



(88) 



where X , X' and U are independent, U is uniformly distributed on [0, 1] and X and X' have 
identical distributions. Then we have 



^ uj^-^A, j = 1, 



, m. 



Proof. We abbreviate 7 := 2/(mr(a; + 1)). For X, X' and U independent, U uniformly on 
[0, 1] distributed, X and X' identically distributed with EX = 7 we have 



E [U'^X + uj{l ~ U)'^X'] 



1 + UJ 



i'j + ujj) = 7, 



hence the map of probability measures on C associated to (88) maps A^2 (7) ^^^'^ itself. The 
argument of the proof of Theorem 5.3 implies that this map is a contraction on (7^2(7)) ^2)- 
Hence it has a unique fixed point /2(A). We have 

(£(A), £(tjA), . . . , £(w'"-iA)) G M^{ki) x • • • x M'^{Km) 



and , by plugging into (86), we find that this vector is a fixed-point of T. Since, by Lemma 
6.8, there is only one fixed-point of T in A^2('*i) ^ • • • x (^m) the assertion follows. □ 
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The asymptotic periodic behavior in the following theorem has already been shown almost 
surely by martingale methods in j22^ Section 4.2], see also jl21 Theorem 3.24]. Our contraction 
approach adds the characterization of C{A) as the fixed-point in (88). The proof is based on 
the complex version of the £2 metric and resembles ideas from Fill and Kapur [9], see also 
[T3l Theorem 5.3]. 



Theorem 6.10. Let m > 7 and xl^^ as in and C{A) the unique fixed-point in Lemma 
6.9. Then, for all j = 1, . . . , m, we have 

h (e^(^''^(")+2-'^s^)A)) ^0 (n ^ 00). 



(89) 



Proof. Let A^^\ . . . , Al™"! be indepe nden t random variables such that (£(A[^1), . . . , /^(At™!)) is 
the unique fixed-point as in Lemma 6.8 Set Al'""'"^] := A^^h Note that for the random variable 



within the real part in (89) with Lemma 6.9 we have 

gi(Mln(n)+2^i^)^ = n'^'uj^-^A 



n 



(90) 



The fixed-point property of the A^^ implies 

3? (n'/^Al-'l) = K (n'^t/^Al^l) + K (n'^'il - U)'^A^+^^ 
for all J = 1, . . . ,m and n > 0. We denote 

Aj{n) :=i2 (xI^lK^n'/^Abl)) 

and set Am+i(n) '■= Ai(n). Now, we assume that the Xn\ A^^'^, n > 1, 1 < j < m, I„, U 
appearing in (85) and (86) are defined on one probability space such that ixl^\?fi{n'^'A^^)) 
are optimal ^2-couplings for all n > and all 1 < j < m and such that = [nU J . Then we 
have 



(91) 



(92) 



A,(n) 



< 



X 



[j] 

In 



3f? 



n 



IL A [i] 



+ 



n 



X 



[i+i] 

J n 



n 



3? 



^Al^A - 3ft (n'/^t/'^Abl) + 3ft r^Al-'+^A - 3? (n^^C/^Al-'+il) 



+ 



Si + ^2 + ^3 + 



(93) 



First note that the summands S2 and 53 tend to zero: We have {In/ri)'^ — )■ almost surely 
by In = \nU\. Since At-^l and A^-''*'^] have finite second moments we can apply dominated 
convergence to obtain ^2, ^3 — )• as n — )• 00. 

For the estimate of the first summand 5i we abbreviate 



n 



X 



In 



3? ( ^A^] 



X 



[i+1] 



3? 



A^+il . 
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Then we have 



+ E 



+ 2E 



(94) 



Conditioning on and using that (x[-''^,3f?(A:''^Abl)) are optimal ^2-couphngs we obtain 



E 



n-l 



k=0 
n-l 

E 

fc=0 
n-l 

E 

k=0 



E 



1 /k 



n \ n 



1 /k 



2A 



E 



n \n 



2A 



n 



2A 



Analogously, we have 



. X 2A 



To bound the mixed term in (94) note that by the expansion (84) and the normalization 
(85) we have E[xif^] = ^{kju^'^) + rj{n) with rj{n) — )■ as n — )■ oo for all j = I 

In particular, we have ||rj||oo < oo. Together with E[A[-'1] = kj this implies E[Wi 
E[(/„/n)Vj-(/„)] and 



. . , m. 

[ill _ 



E 



E 



In Jn 



n n 



rj{In)rj+i{Jn 



(95) 



To show that the latter term tends to zero let e > 0. Then there exists /cq G N such that 
rj{k) < £, rj^i{k) < e for all k > k^. For all n > 2/co we obtain, by considering the event 
{ko < In < n — 1 — ko} and its complement, 



E 



<■ II II II -L 2 

n 



Hence, we obtain that the mixed term (95) tends to zero. 



Altogether, we obtain from (93) as n — )• oo that 



Aj{n) < i E 



< < 2E 



. X 2A 



-E 



X \ 2A 
Jn \ .2 

n ' 



>| 1/2 

+ o{l)\ +0(1) 



2A 



A2(/„ 



>j 1/2 

+ 0(1)} +o(l), 
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for all j = 1, . . . , m, where 



A(n) := max Aj{n). 

l<j<m 



Hence, we have 



A(n) < < 2E 



2A 



1/2 



+ 0(1) 



+ o(l). 



(96) 



Now, we obtain A(n) — )• as in the proof of Theorem 6.1 First from (96) we obtain with 
Iri/n ^ U almost surely that 



A(n) < <^ 2E 



< 



1 + 2A 



2A1 ^ 1/2 

max A2(fc) + o(l)l +o(l) 

0<K<ri— 1 I 
\ ^ 1/2 

+ 0(1) max A2(/c)+o(1) + o(1 

/ 0<K<n— 1 I 



Since A > 1/2 this implies that the sequence (A(n))„>o is bounded. We denote rj := 
sup„>oA(n) and := limsup„_^oo A(n). For any e > there exists an no > such that 
A(n) < ^ + e for all n > no. Hence, from (96) we obtain 

2Al r / T \ 2A' 



A(n) < { 2E 



ry^ + 2E 



H/n>no} 



X 1/2 

{i + ef + o{l)\ +o(l) 



With n — )• oo this implies 



1 + 2A 



Since y^2/(l + 2A) < 1 and e > is arbitrary this implies ^ = 0. 



□ 
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